AITopics | cnn dm

Collaborating Authors

cnn dm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

9b6d7202750e8e32cd5270eb7fc131f7-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 00:06:11 GMT

information, summarization, summarization model, (17 more...)

Neural Information Processing Systems

Country:

South America > Ecuador (0.14)
North America > Costa Rica (0.14)
Europe > Belgium (0.04)
South America > Brazil (0.04)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Sports > Soccer (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

89d4402dc03d3b7318bbac10203034ab-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 18:25:34 GMT

attention distribution, beam search, global attention distribution, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Alaska (0.04)
North America > United States > Minnesota (0.04)
(10 more...)

Industry:

Transportation > Air (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

6fac9e316a4ae75ea244ddcef1982c71-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 17:06:19 GMT

consistency, efficiency gain, prediction, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

9b6d7202750e8e32cd5270eb7fc131f7-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 16:17:55 GMT

information, summarization, summarization model, (17 more...)

Neural Information Processing Systems

Country:

South America > Ecuador (0.14)
North America > Costa Rica (0.14)
Europe > Belgium (0.04)
South America > Brazil (0.04)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Sports > Soccer (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

89d4402dc03d3b7318bbac10203034ab-Supplemental.pdf

Neural Information Processing SystemsAug-15-2025, 18:10:41 GMT

attention distribution, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Alaska (0.04)
North America > United States > Minnesota (0.04)
(11 more...)

Industry:

Transportation > Air (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

A Mathematical Details

Neural Information Processing SystemsAug-15-2025, 17:37:54 GMT

We provide additional experimental results to supplement Section 6 . In Section B.1, we include In Section B.2, we present a few example outputs with a visualization In Section B.3, we include results Figure B.1 and Figure B.2 present the empirical The bottom row presents the average number of decoder layer used. The bottom row presents the average number of decoder layer used. Figure B.5 presents two example outputs of CALM for instances from the machine translation, and We observe that the textual distance generally increases as we accelerate the decoding. Interestingly, following our initial intuition, CALM distributes the compute unevenly, using very few layers for certain "easy" tokens, and additional compute to "hard" tokens.

consistency, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding

Zarch, Hossein Entezari, Gao, Lei, Jiang, Chaoyi, Annavaram, Murali

arXiv.org Artificial IntelligenceAug-8-2025

Speculative Decoding (SD) is a widely used approach to accelerate the inference of large language models (LLMs) without reducing generation quality. It operates by first using a compact model to draft multiple tokens efficiently, followed by parallel verification using the target LLM. This approach leads to faster inference compared to auto-regressive decoding. While there are multiple approaches to create a draft model, one promising approach is to use early-exit methods. These methods draft candidate tokens by using a subset of layers of the primary model and applying the remaining layers for verification, allowing a single model to handle both drafting and verification. While this technique reduces memory usage and computational cost, its performance relies on the choice of the exit layer for drafting and the number of tokens drafted (speculation length) in each SD round. Prior works use hyperparameter exploration to statically select these values. However, our evaluations show that these hyperparameter values are task-specific, and even within a task they are dependent on the current sequence context. We introduce DEL (Dynamic Exit Layer), a plug-and-play method that adaptively selects the exit layer and speculation length during inference. DEL dynamically tracks the token acceptance rate if the tokens are drafted at each layer of an LLM and uses that knowledge to heuristically select the optimal exit layer and speculation length. Our experiments across a broad range of models and downstream tasks show that DEL achieves overall speedups of $2.16\times$$\sim$$2.62\times$ over vanilla auto-regressive decoding and improves upon state-of-the-art SD methods, which peak at $2.43\times$, by up to $0.19\times$. The code is available at https://github.com/hoenza/DEL.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2504.05598

Country: North America > United States (1.00)

Genre: Research Report (1.00)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

BRIDO: Bringing Democratic Order to Abstractive Summarization

Lee, Junhyun, Goka, Harshith, Ko, Hyeonmok

arXiv.org Artificial IntelligenceFeb-25-2025

Hallucination refers to the inaccurate, irrelevant, and inconsistent text generated from large language models (LLMs). While the LLMs have shown great promise in a variety of tasks, the issue of hallucination still remains a major challenge for many practical uses. In this paper, we tackle the issue of hallucination in abstract text summarization by mitigating exposure bias. Existing models targeted for exposure bias mitigation, namely BRIO, aim for better summarization quality in the ROUGE score. We propose a model that uses a similar exposure bias mitigation strategy but with a goal that is aligned with less hallucination. We conjecture that among a group of candidate outputs, ones with hallucinations will comprise the minority of the whole group. That is, candidates with less similarity with others will have a higher chance of containing hallucinated content. Our method uses this aspect and utilizes contrastive learning, incentiviz-ing candidates with high inter-candidate ROUGE scores. We performed experiments on the XSum and CNN/DM summarization datasets, and our method showed 6.25% and 3.82% improvement, respectively, on the consistency G-Eval score over BRIO.

brio, hallucination, reference summary, (16 more...)

arXiv.org Artificial Intelligence

2502.18342

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(4 more...)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding

Zhao, Weilin, Huang, Yuxiang, Han, Xu, Xu, Wang, Xiao, Chaojun, Zhang, Xinrong, Fang, Yewei, Zhang, Kaihuo, Liu, Zhiyuan, Sun, Maosong

arXiv.org Artificial IntelligenceJun-26-2024

Speculative decoding is a widely used method that accelerates the generation process of large language models (LLMs) with no compromise in model performance. It achieves this goal by using an existing smaller model for drafting and then employing the target LLM to verify the draft in a low-cost parallel manner. Under such a drafting-verification framework, drafting efficiency has become a bottleneck in the final speedup of speculative decoding. Therefore, generating longer drafts at less cost can lead to better decoding speedup. To achieve this, we introduce Ouroboros, which can generate draft phrases to parallelize the drafting process and meanwhile lengthen drafts in a training-free manner. The experimental results on various typical text generation tasks show that Ouroboros can achieve speedups of up to $2.4\times$ over speculative decoding and $3.9\times$ over vanilla decoding, without fine-tuning draft and target models.

arxiv preprint arxiv, ouroboro, target model, (14 more...)

arXiv.org Artificial Intelligence

2402.1372

Country: